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^ We consider a small extent sensor network for event detection, in which nodes take samples 

periodically and then contend over a random access network to transmit their measurement packets 
to the fusion center. We consider two procedures at the fusion center to process the measurements. 
' The Bayesian setting is assumed; i.e., the fusion center has a prior distribution on the change 

^■f^ ^ time. In the first procedure, the decision algorithm at the fusion center is network-oblivious and 

makes a decision only when a complete vector of measurements taken at a sampling instant is 
available. In the second procedure, the decision algorithm at the fusion center is network-aware 
and processes measurements as they arrive, but in a time causal order. In this case, the decision 
statistic depends on the network delays as well, whereas in the network-oblivious case, the decision 
, statistic does not depend on the network delays. This yields a Bayesian change detection problem 

^ ' with a tradeoff between the random network delay and the decision delay; a higher sampling rate 

reduces the decision delay but increases the random access delay. Under periodic sampling, in the 
network-oblivious case, the structure of the optimal stopping rule is the same as that without the 
network, and the optimal change detection delay decouples into the network delay and the optimal 
^ i decision delay without the network. In the network-aware case, the optimal stopping problem is 

' analysed as a partially observable Markov decision process, in which the states of the queues and 

delays in the network need to be maintained. A sufficient statistic for decision is found to be the 
, network-state and the posterior probability of change having occurred given the measurements 

received and the state of the network. The optimal regimes are studied using simulation. 

Categories and Subject Descriptors: C.2.3 [Computer-Communication Networks]: Net- 
' work Operations — A^etworA; moraitormg; 1.2.8 [Artificial Intelligence] : Problem Solving, Control 

Methods, and Search — Control theory 

General Terms: Algorithms, Design, Performance 

Additional Key Words and Phrases: Optimal change detection over a network, detection delay, 
cross-layer design of change detection 
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Fig. 1. An ad hoc wireless sensor network with a fusion center is shown. The small circles are 
the sensor nodes ("motes"), and the lines between them indicate wireless links obtained after a 
self-organization procedure. 

1. INTRODUCTION 

A wireless sensor network is formed by tiny, untethered devices ( "motes" ) that can 
sense, compute and communicate. Sensor networks have a wide range of apph- 
cations such as environment monitoring, detecting events, identifying locations of 
survivors in building and train disasters, and intrusion detection for defense and 
security applications. For factory and building automation applications, there is in- 
creasing interest in replacing wireline sensor networks with wireless sensor networks, 
due to the potential reduction in costs of engineering, installation, operations, and 
maintenance [Honeywell ln(\ [ISA]. 

Event detection is an important task in many sensor network applications. In 
general, an event is associated with a change in the distribution of a related quan- 
tity that can be sensed. For example, the event of a gas leakage at any joints in a 
pipe causes a change in the distribution of pressure at the joint and hence can be 
detected with the help of pressure sensors. In this paper, we limit our discussion 
to the centralized fusion model (see Figure 1), in which each mote, in an event de- 
tection network, senses and sends some function of its observations (e.g., quantized 
samples) to the fusion center at a particular rate. The fusion center, by appropri- 
ately processing the sequence of values it receives, makes a decision regarding the 
state of nature, i.e., it decides whether a change has occurred or not. 

Our problem is that of minimizing the mean detection delay (the delay between 
the event occurring and the detection decision at the fusion center) with a bound 
on the probability of false alarm. We consider a small extent network in which all 
the sensors have the same coverage, i.e., when the change in distribution occurs it 
is observed by all the sensors and the statistics of the observations are the same at 
all the sensors. N sensors synchronously sample their environment at a particular 
rate. Synchronized operation across sensors is practically possible in networks such 
as 802.11 WLANs and Zigbee networks since the access point and the PAN coordi- 
nator, respectively, transmit beacons that provide all nodes with a time reference. 
Based on the measurement samples, the nodes send certain values (e.g., quantized 
samples) to the fusion center. Each value is carried by a packet, which is transmit- 
ted using a contention-based multiple access mechanism. Thus, our small extent 
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Fig. 2. The sensors take samples periodically at instants ti,t2,---, and prepare to send to the 
fusion center a vector of values = [^^j^^^ , ^^f^^ > ■ ■ ■ i ^i^^ j ■ Each is queued as a packet in 

the queue of the respective node. Due to multiple access delays, the packets arrive with random 
delays at the fusion center; for example, for X2, the delays , for the packets from 

sensors 1, 2 and 3, are shown. 



network problem is a natural extension of the standard change detection problem 
(see [Veeravalli 2001] and the references therein) to detection over a random access 
network. The problem of quickest event detection problem in a large extent network 
(where the region of interest is much larger than the sensing coverage of any sensor) 
is considered by us in [Premkumar et al. 2009]. Also, a small extent network can 
be thought of as a cluster in a large extent network and that the decision maker 
can be thought of as a cluster head. 

In this setting, due to the multiple access network delays between the sensor 
nodes and the fusion center, several possibilities arise. In Figure 2 we show that 
although the sensors take samples synchronously, due to random access delays the 
various packets sent by the sensors arrive at the fusion center asynchronously. As 
shown in the figure, the packets generated due to the samples taken at time t2 
arrive at the fusion center with a delay of D^'' , d!^^ , D^^ , etc. It can even happen 
that a packet corresponding to the samples taken at time ^3 can arrive before one 
of the packets generated due to the samples taken at time t2- 

Figure 3 depicts a general qucucing and decision making architecture in the fusion 
center. All samples are queued in per-node queues in a sequencer. The way the 
sequencer releases the packets gives rise to the following three cases, the first two 
of which we study in this paper. 

(1) The sequencer queues the samples until all the samples of a "batch" (a batch 
is the set of samples generated at a sampling instant) are accumulated; it then 
releases the entire batch to the decision device. The batches arrive to the 
decision maker in a time sequence; ordcir. The decision maker processes the 
batches without knowledge of the state of the network (i.e., reception times 
at the fusion center, and the numbers of packets in the various queues). We 
call this. Network Oblivious Decision Making (NODM). In factory and building 
automation scenarios, there is a major impetus to replace wireline networks 
between sensor nodes and controllers. In such applications, the first step could 
be to retain the fusion algorithm in the controller, while replacing the wireline 
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Fig. 3. A conceptual block diagram of the wireless sensor network shown in Figure 1. The fusion 
center has a sequencing buffer which queues out-of-sequence samples and delivers the samples to 
the decision maker in time-order, as early as possible, batch— wise or sample— wise. 

network with a wireless ad hoc network. Indeed, we show that this approach 
is optimal for NODM, provided the sampling rate is appropriately optimized. 

(2) The sequencer releases samples only in time-sequence order but does not wait 
for an entire batch to accumulate. The decision maker processes samples as 
they arrive. We call this, Network Aware Decision Making (NADM). In NADM, 
whenever the decision makcir rciccnvcis a sample!, it has to roll back its decision 
statistic to the sampling instant, update the decision statistic with the received 
sample and then update the decision statistic to the current time slot. The 
decision maker makes a Bayesian update on the decision statistic even if it 
does not receive a sample in a slot. Thus, NADM requires a modification in the 
decision making algorithm in the fusion center. 

(3) The sequencer does not queue any samples. The fusion center acts on the values 

from the various sampling instants as they arrive, possibly out of order. The 
formulation of such a problem would be an interesting topic for future research. 

Our Contributions: We find that, in the existing literature on sequential change 
detection problems (see discussion on related literature below), it has been assumed 
that, at a sampling instant, the samples from all the sensors reach the fusion center 
instantaneously. As explained above, however, in our problem the delay in detection 
is not only due to the detection procedure requiring a certain amount of samples to 
make a decision (which we call decision delay), but also due to the random packet 
delay in the multiple access network (which we call network delay). We work with 
a formulation that accounts for both these delays, while limiting ourselves to the 
particular fusion center behaviours explained in cases (1) and (2) above. 

In Section 2, we discuss the basic change detection problem and setup the model. 
In Section 3, we formulate the change detection problem over a random access 
network in a way that naturally includes the network delay. We show that in the 
case of NODM, the problem objective decouples into a part involving the network 
delay and a part involving the optimal decision delay, under the condition that 
the sampling instants are periodic. Then, in Section 4, we consider the special 
case of a network with a star topology, i.e., all nodes are one hop away from the 
fusion center and provide a model for contention in the random access network. 
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In Section 5, we formulate the NADM problem where we process the samples as 
they arrive at the fusion center, but in a time causal manner. The out-of-sequence 
packets are queued in a sequencing biifFcr and arc released to the decision maker as 
early as possible. We show in the NADM case that the change-detection problem 
can be modeled as a Partially Observable Markov Decision Process (POMDP). 
We show that a sufficient statistic for the observations include the network-state 
(which include the queue lengths of the sequencing buffer, network-delays) and the 
posterior probability of change having occurred given the measurements received 
and the network states. As usual, the optimal policy can be characterised via a 
Bellman equation, which can then be used to derive insights into the structure of the 
policy. We show that the optimal policy is a threshold on the posterior probability 
of change and that the threshold, in general, depends on the network state. Finally, 
in Section 6 we compare, numerically, the mean detection delay performance of 
NODM and a simple heuristic algorithm motivated by NADM processing. We show 
the tradeoff between the sampling rate r and the mean detection delay. Also, we 
show the tradeoff between the number of sensors and the mean detection delay. 
Related Literature: The basic mathematical formulation in this paper is an 
extension of the classical problem of sequential change detection in a Bayesian 
framework. The centralized version of this problem was solved by Shiryaev (see 
[Shiryaev 1978]). The decentralized version of the problem was introduced by Tenny 
and Sandell [Tenny and Sandell 1981]. In the decentralized setting, Vecravalli 
[Veeravalli 2001] provided optimal decision rules for the sensors and the fusion 
center, in the context of conditionally independent sensor observations and a quasi- 
classical information structure. For a large network setting, Niu and Varshney [Niu 
and Varshney 2005] studied a simple hypothesis testing problem and proposed a 
counting rule based on the number of alarms. They showed that, for a sufficiently 
large number of sensors, the detection performance of the counting rule is close 
to that of the optimal rule. In a recent article on anomaly detection in wireless 
sensor networks [Rajasegarar et al. 2008], Rajasegarar et al. have provided a survey 
of statistical and machine learning based techniques for detecting various types of 
anomalies such as sensor faults, security attacks, and intrusions. In [Aldosari and 
Moura 2004] the authors consider the problem of decentralized binary hypothesis 
testing, where the sensors quantize the observations and the fusion center makes a 
binary decision between the two hypotheses. 

Remark: In the existing literature on the topic of optimal sequential event 
detection in wireless sensor networks, to the best of our knowledge there has been no 
prior formulation that incorporates multiple access delay between the sensing nodes 
and the fusion center. Interestingly, in this paper we introduce, what can be called 
a cross layer formulation involving sequential decision theory and random access 
network delays. In particular, we encounter the fork-join queueing model (see, 
for example, [Baccelli and Makowski 1990]) that arises in distributed computing 
literature. 

2. THE BASIC CHANGE DETECTION PROBLEM 

In this section, we introduce the model for the basic change detection problem. The 
notation, we follow, is given here. 
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Fig. 4. Time evolves over slots. The length of a slot is assumed to be unity. Thus, slot k represents 
the interval [k, fc + 1) and the beginning of slot k represents the time instant k. Samples are taken 
periodically every 1/r slots, starting from ti = 1/r. 
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Fig. 5. Change time and the detection instants with and without network delay are shown. The 
coarse sampling delay is given by — T where Ik is the first sampling instant after change, and 
the network delay is given by Uf. —tf^. 



Time is slotted and the slots are indexed by k = 0,1,2 We assume that the 

length of a slot is unity and that slot k refers to the interval [k, k + 1). Thus, the 
beginning of slot k indicates the time instant k (see Figure 4). 

N sensors are synchronously sampling at the rate r samples/slot, i.e., the sensors 
make an observation every 1/r slots and send their observations to the fusion 
center. Thus, for example, if r = 0.1, then a sample is taken by a sensor every 
10*'' slot. We assume that 1/r is an integer. The sampling instants are denoted 
t\,t'2,--- (see Figure 5). Define Iq = 0; note that the first sample is taken at 
ti = 1/r. 

The vector of network delays of the batch h is denoted by 



D, = 



n(i) n(2) ... nW 

^^b ^ ^^b 



where D^^ e {1, 2, 3, ■ • ■ } is the network delay in slots, of the ith component of 

the &th batch (sampled at ti, = h/r). Also, note that d\^'^ ^ 1, as it requires one 
time slot for the transmission of a packet to the fusion center after a successful 
contention. 

• The state of nature O G {0, 1}. represents the state "before change" and 
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1 represents the state "after change". It is assumed that the change time T 
(measured in slots), is geometrically distributed i.e., 

P (T = 0) = p 

and, forfc^l, P (T = fc | T > 0) = p(l -p)^*^-^^ (1) 

The value of for T accounts for the possibility that the change took place before 
the observations were made. 



The vector of outputs from the sensor devices at the 6th batch is denoted by 



3^(1) ^{2) y{N) 



where X^*^ G X is the 6th output at the ith sensor. Given the state of nature, 
X^'^ are assumed to be (conditionally) independent across sensors and i.i.d. over 
sampling instants with probability distributions Fq{x) and Fi{x) before and after 
the change respectively. Xi corresponds to the first sample taken. In this work, 
we do not consider the problem of optimal processing of the sensor measurements 
to yield the sensor outputs, e.g., optimal quantizers (see [Veeravalli 2001]). 

Let Sb, b ^ 1, he the state of nature at the 6th sampling instant and Sq the state 
at time 0. Then Sj, G {0, 1}, with 

P (5o = 1) = p = 1 - P (5*0 = 0) 

Sb evolves as follows. U Sb = for 6^0, then 



Sb+i = 



1 W.p. Pr 
W.p. (1 — Pr) 



where = 1 - (1 - p)i/''. Further, if ^5 = 1, then Sb+i = 1. Thus, if 5*0 = 0, 
then there is a change from to 1 at the Kth sampling instant, where K is 
geometrically distributed. For 6^1, 



P{K = b)=Pr{l-p, 



\b-l 



Each value to be sent to the fusion center by a node is inserted into a packet 
which is queued for transmission. It is then transmitted to the fusion center by 
using a contention based multiple access protocol. A node can directly transmit 
its observation to the fusion center or route it through other nodes in the system. 
Each packet takes a time slot to transmit. The MAC protocol and the queues 
evolve over the same time slots. The fusion center makes a decision about the 
change depending on whether Network Oblivious (NODM) processing or Network 
Aware (NADM) processing is employed at the fusion center. In the case of NODM 
processing, the decision sequence (also called as action sequence), is A„, u > 0, with 
Au G {stop and declare change(l), take another sample(O)}, where « is a time in- 
stant at which a complete batch of N samples corresponding to a sampling instant is 
received by the fusion center. In the case of NADM processing, the decision sequence 
is Ak, fc ^ 0, with Ak G {stop and declare change(l), take another sample(O)}, i.e., 
a decision about the change is taken at the beginning of every slot. 
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Fig. 6. A sensor network model of Figure 3 with one hop communication between the sensor 
nodes and the fusion center. The random access network along with the sequencer is a fork— join 
queueing model. 



3. NETWORK OBLIVIOUS DECISION MAKING (NODM) 

From Figure 2, wc note that although ah the components of a batch b are generated 
at tj, = b/r, they reach the fusion center at times tb + D^\i = 1,2, - ■ ■ ,N. In 
NODM processing, the samples, which are successfully transmitted, are queued 
in a sequencing buffer as they arrive (see Figure 6) and the sequencer releases a 
(complete) batch to the decision maker, as soon as all the components of a batch 
arrive. The decision maker makes a decision about the change at the time instants 
when a (complete) batch arrives at the fusion center. In the Network Oblivious 
(NODM) processing, the decision maker is oblivious to the network and processes 
the batch as though it has just been generated (i.e., as if there is no network, hence 
the name Network Oblivious Decision Making). We further define (see Figure 5) 



• Ub, {b ^ 1): the random instant at which the fusion center receives the complete 
batch X(, 

• K € {0,1, . . .}: the batch index at which the decision takes place, if there was 
no network delay. K = means that the decision 1 (stop and declare change) is 
taken before any batch is generated 

• T = tj^: the random time (a sampling instant) at which the fusion center stops 

and declares change, if there was no network delay 

• U = Uj^: the random time slot at which the fusion center stops and declares 
change, in the presence of network delay 

• Dfy = Ub — tb: Sojourn time of the bth batch, i.e., the time taken for all the 
samples of the bth batch to reach the fusion center. Note that Db is given by 
max{£)^'' : i = 1,2,- •• ,N}. Thus, the delay of the batch K at which the 
detector declares a change is Uf^ — t^ = U — T 

We define the following detection metrics. 

Mean Detection Delay defined as the expected number of slots between the 
change point T and the stopping time instant U in the presence of coarse sampling 
and network delays, i.e.. Mean Detection Delay = E {u — T^ l^f^j,^ 
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Mean Decision Delay defined as the expected number of slots between the change 
point T and the stopping time instant T in the (presence of coarse sampling delay 
and in the) absence of network delay, i.e., Mean Decision Delay = E — 1{t>t} 

With the above model and assumptions, we pose the following NODM problem: 
Minimize the mean detection delay with a bound on the probability of false alarm, 
the decision epochs being the time instants when a complete batch of components 
corresponding to a sampling instant is received by the fusion center. In Section 5, 
we pose the problem of making a decision at every slot based on the samples as 
they arrive at the fusion center. Motivated by the approach in [Veeravalli 2001] we 
use the following formulation for a given sampling rate r 

minE {U-T)l^f^r^y (2) 

such that p(f <T^ < a 

where a is the constraint on the false alarm probability. 



Fig. 7. Illustration of an event of false alarm with T < T, but U > T 



Remark 3.1 Note that if a ^ 1 — then the decision making procedure can be 
stopped and an alarm can be raised even before the first observation. Thus, we 
assume that a < 1 — p. 

Remetrk 3.2 Note that here we consider as the probability of false 

alarm and not P(U <T^, i.e., a case as shown in Figure 7 is considered a false 

alarm. This can be understood as follows: when the decision unit detects a change 
at slot U, the measurements that triggered this inference would be carrying the 
"time stamp" T, and we infer that the change actually occurred at or before T. 
Thus if T < T, it is an error. 

We write the problem defined in Eqn. 2 as 



min E 

Ho 



{U-T)l^f^^y (3) 



where Ha is the set of detection policies for which P < 



^ a. 



Theorem 1 // the sampling is periodic at rate r and the batch sojourn time process 
Di,, b ^ 1, is stationary with mean d{r), then 

+ 



min E 

n„ 



(f/-T)l^^^J = (rf(r) + Z(r))(l-a)-p./(r) + iniinE 



K-K 



r 

where l{r) is the delay due to (coarse) sampling. 
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Remark 3.3 For example in Figure 5, the delay due to coarse sampling ist2—T, 
K — K = 3 — 2 = 1, and the network delay is U^—t^. The stationarity assumption 

on Djj, b ^ 1, is justifiable in a network in which measurements are continuously 
made, but the detection process is started only at certain times, as needed. 



I) 



Proof: The following is a sketch of the proof (the details are in the Appendix 



min E 

n„ 



mm 

n„ 



+ E 



T-T 



min<jE[2?] (l-P(T<r)) 



T-T 



where we have used the fact that under periodic sampling, the queueing system 
evolution and the evolution of the statistical decision problem are independent, 
i.e., K is independent of {Di, D2, ■ ■ ■} and E[D] is the mean stationary queueing 
delay (of a batch). By writing E[D] ~ d{r) and using the fact that the problem 



minn„ E 
becomes 



T-T 



is solved by a policy tt* S Ha with = a, the problem 



d{r){l — a) + inin E 



T-T 



{d{r) + l{r)){l - a) - p ■ l{r) 



— min E 



K - K 



where l{r) is the delay due to sampling. Notice that minn„ E 
change detection problem at the sampling instants. □ 



K-K 



is the basic 



Remark 3.4 It is important to note that the independence between K and {Di, D^, ■ 
arises from periodic sampling. Actually this is conditional independence given the 
rate of the periodic sampling process. If, in general, one considers a model in which 
the sampling is at random times (e.g., the sampling process randomly alternates 
between periodic sampling at two different rates or if adaptive sampling is used) 
then we can view it as a time varying sampling rate and the asserted independence 
will not hold. 

We conclude that the problem defined in Eqn. 2 effectively decouples into the sum 
of the optimal delay in the original optimal detection problem, i.e., ^ minn„ E K — K 

as in [Veeravalli 2001], a part that captures the network delay, i.e., d{r){l — a), and 
a part that captures the sampling delay, i.e., Z(r)(l — a) — pl{r). 

4. NETWORK DELAY MODEL 

From Theorem 1, it is clear that in NODM processing, the optimal decision device 
and the queueing system are decoupled. Thus, one can employ an optimal sequential 
change detection procedure (see [Shiryaev 1978]) for any random access network 
(in between the sensor nodes and the fusion center). Also, NODM is oblivious to 
the random access network (in between the sensor nodes and the fusion center) 
and processes a received batch as though it has just been generated. In the case 
of NADM (which we describe in Section 5), the decision maker processes samples, 
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Fig. 8. A sensor network with a star topology with the fusion center at the hub. The sensor nodes 
use a random access MAC to send their packets to the fusion center. 

keeping network-delays into account, thus requiring the knowledge of the network 
dynamics. In this section, we provide a simple model for the random access network, 
that facilitates the analysis and optimisation of NADM. 

N sensors form a star topology^ (see Figure 8) ad hoc wireless sensor network 
with the fusion center as the hub. They synchronously sample their environment 
at the rate of r samples per slot periodically. At sampling instant th = b/r, sensor 
node i generates a packet containing the sample value xj^^^ (or some quantized 
version of it). This packet is then queued first-in-first-out in the buffer behind the 
radio link. It is as if each sample is a fork operation that puts a packet into each 
sensor queue (see Figure 6). 

The sensor nodes contend for access to the radio channel, and transmit packets 
when they succeed. The service is modeled as follows. As long as there are packets 
in any of the queues, successes are assumed to occur at the constant rate of cr (0 < 
a <1) per slot, with the intervals between the successes being i.i.d., geometrically 
distributed random variables, with mean l/cr. If, at the time a success occurs, there 
are n nodes contending (i.e., n queues are nonempty) then the success is ascribed 
to any one of the n nodes with equal probability. 

The N packets corresponding to a sample arrive at random times at the fusion 
center. If the fusion center needs to accumulate all the components of each 
sample then it must wait for that component of every sample that is the last to 
depart its mote. This is a join operation (see Figure 6). 

It is easily recognized that our service model, in the case of NODM is the discrete 
time equivalent of generalized processor sharing (GPS - see, for example, [Kumar 
et al. 2004]), which can be called the FJQ-GPS (fork-join queue (see [Baccelli and 
Makowski 1990]) with GPS service). In the case of NADM, the service model is just 
the GPS. 

In IEEE 802.11 networks and IEEE 802.15.4 networks, if appropriate parameters 
are used, then the adaptive backoff mechanism can achieve a throughput that is 
roughly constant over a wide range of n, the number of contending nodes. This 



^Note that Theorem 1 is more general and does not assume a star topology. 

ACM Transactions on Sensor Networks, Vol. V, No. N, Month 20YY. 



12 



r\vsn for different transmission rates (C) in RTS-CTS 



X 6 



^ 4 





— Analysis 




-» ' Simulation 


.-.-..-n . .£=11 Mbps 




^ - - ----v-o.^«. 








• ■•■U--U--U-.U-.( 


. C=2.2MbDS 







5 10 15 

Number of nodes (n) — > 



20 



Fig. 9. The aggregate saturation throughput rj of an IEEE 802.11 network plotted against the 
number of nodes in the network, for various physical layer bit rates: 2.2 Mbps, 5.5 Mbps, and 
11 Mbps . The two curves in each plot correspond to an analysis and an NS-2 simulation. 
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Fig. 10. The aggregate saturation throughput rj of an IEEE 802.15.4 star topology network plotted 
against the number of nodes in the network. Throughput obtained with default backoff parameters 
is shown on the left and that obtained with backoff multiplier = 3, is shown on the right. The 
two curves in each plot correspond to an analysis and an NS-2 simulation. 



is well known for the CSMA/CA implementation in IEEE 802.11 wireless LANs; 
see, for example, Figure 9 [Kumar et al. 2008]. For each physical layer rate, the 
network service rate remains fairly constant with increasing number of nodes. From 
Figure 10 (taken from [Singh et al. 2008]) it can be seen that with the default backoff 
parameters, the saturation throughput of a star topology IEEE 802.15.4 network 
decreases with the number of nodes N, but with the backoff multiplier = 3, the 
throughput remains almost constant from iV = 10 to iV = 50 [Singh et al. 2008]; 
thus, in the latter case our GPS model can be applicable. 

Theorem 2 The stationary delay D is a proper random variable with finite mean 
if and only if Nr < a. 

Proof: See Appendix - II. □ 

Thus, for the FJQ-GPS queueing system to be stable, the sampling rate r is chosen 
such that r < jf. 
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5. NETWORK AWARE DECISION MAKING (NADM) 

In Section 3, we formulated the problem of NODM quickest change detection over 
a random access network, and showed that (when the decision instants are Uk, as 
shown in Figure 5) the optimal decision maker is independent of the random access 
network, under periodic sampling. Hence, the Shiryaev procedure, which is shown 
to be delay optimal in the classical change detection problem (see [Shiryaev 1978]), 
can be employed in the decision device independently of the random access network. 
It is to be noted that the decision maker in the NODM case, waits for a complete 
batch of N samples to arrive, to make a decision about the change. Thus, the mean 
detection delay of the NODM has a network delay component corresponding to a 
batch of N samples. In this section, we provide an alternative mechanism of fusion 
at the decision device called Network Aware Decision Making (NADM), in which 
the fusion algorithm does not wait for an entire batch to arrive, and processes the 
samples as soon as they arrive, but in a time-causal manner. 

We now describe the processing in NADM. Whenever a node (successfully) trans- 
mits a sample across the random access network, it is delivered to the decision 
maker if the decision maker has received all the samples from all the batches gen- 
erated earlier. Otherwise, the sample is an out-of-scquencc sample, and is queued 
in the sequencing buffer. It follows that, whenever the (successfully) transmitted 
sample is the last component of the batch that the decision maker is looking for, 
then the head of line (HOL) components, if any, in the queues of the sequencing 
buffer are also delivered to the decision maker. This is because, these HOL samples 
belong to the next batch that the decision maker should process. The decision 
maker makes a decision about the change at the beginning of every time slot, irre- 
spective of whether it receives a sample or not. In NADM, whenever the decision 
maker receives a sample, it takes into account the network delay of the sample 
while computing the decision statistic. The network-delay is a part of the state of 
the queueing system which is available to the decision maker. Thus, unlike NODM, 
the state of the queueing system also plays a role in decision making. 

In Section 5.1, we define the state of the queueing system. In Section 5.2, we 
define the dynamical system whose change of state (from to 1) is the subject of 
interest to us. We define the state of the dynamical system as a tuple that contains 
the queueing state, the state of nature, and a delayed state of nature. The delayed 
state of nature is included in the state of the system so that the (delayed) sensor 
observations that the decision maker receives at time instant fc + 1 depend only on 
the state, the control, and the noise of the system at time instant fc, a property 
which is desirable to define a sufficient statistic (see page 244, [Bertsekas 2000a]). 
We explain the evolution of the state of the dynamical system in Section 5.3. In 
Section 5.4, we formulate the NADM change detection problem and we find a suf- 
ficient statistic for the observations in Section 5.5. In Section 5.6, we provide the 
optimal decision rule for the NADM change detection problem. 

5.1 Notation and State of the Queueing System 

Recall the notation introduced in Section 2. Time progresses in slots, indexed by 
= 0, 1, 2 ■ • • ; the beginning of slot fc is the time instant fc. Also, the time instant 
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k+1 



Slots 



Fig. 11. At time k, the decision maker expects samples (or processes samples) from batch B)^. 
Also, at time fc, Afe is the number of slots to go for the next sampling instant and is the number 
of slots back at which batch Bfe is generated. 
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_L 



_L 



_J_ 



_L 



_L 



_J_ 



_L 



k-1 k k+1 
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At=0 
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Fig. 12. Illustration of a scenario in which Afe = 0. If the last component from batch Bfe—i is 
received at k, and if there is no sampling instant between tB,,_i and k, then Afe = 0. Also, note in 
this case that Afe = Afe_|_i = • • • = Aj^^ = 0. In this scenario, at time instants fe, fe + 1, • • • , tB^. , 
all the queues at the sensor nodes and at the sequencer axe empty, and at time instant tBf.+, all 
sensor node queues have one pa<;ket which is generated at tB),- 

just after the beginning of time slot is denoted by k+ ^. Recall that the nodes 

take samples at the instants 1/r, 2/r, 3/r, ■ • ■ . We define the state of the queueing 
system here. Note that the queueing system evolves over slots. 

• Xk G {1,2, - ■ ■ , 1/r} denotes the number of time slots to go for the next sampling 
instant, at the beginning of time slot k (see Figure 11). Thus, 



Afe := 

r 



k 



mod - 

r 



(4) 



Thus, Ao = ^, Ai = i — 1, • • • , and at the samphng instants tb, Xt,, = ^■ 
Bk G {1,2,3, •••} denotes the index of the batch that is expected to be or is 
being processed by the decision maker at the beginning of time slot k. Note 
Bq = Bi = ■■ ■ = Bxjj. = 1. Also, note that the batch Bk is generated at time 
instant Bfe/r. 

Afc G {0, 1, 2, ■ • ■ } denotes the delay in number of time slots between the time 
instants k and B^/r (see Figure 11). 



max 



k-^,Q 



(5) 



^Note that the notation t+ denotes a time embedded to the right of t and is different from the 
notation Recall that {x)~^ :=max{x,0}. 
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A new sample is taken here if A^+i = 1/r 

J / 



T (0 7- («) T _|_ 1 



k ( k+1 slots 



Departure w.p. if iv^'"' > 

(j\ 

Fig. 13. The evolution of Lj_ from time slot k to time slot k+1. If during time slot k, node i 
transmits (successfully) a packet to the fusion center (i.e., = i), then that packet is flushed out 
of its queue at the end of time slot k. Also, a new sample is generated (every 1/r slots) exactly 
at the beginning of a time slot. Thus, Lf_^-^, the queue length of sensor node i just after the 
beginning of time slot + 1 (i.e., at (fc + 1)+) is given by L^'|^ = L^'' - + 

Note that the batches of samples taken after Bk/r and up to (inchiding) k are 
queued either in the sensor node queues or in the sequencing buffer in the fusion 
center. If at time k, the fusion center receives a sample which is the last sample 
from batch Bk-i, then Bk = Bfe-i + 1. If the sampling instant of the Bkth batch 
is later than k (i.e., B^/r > k), then A). = (up to time B^/r at which instant, 
a new batch is generated). This corresponds to the case, when all the samples 
generated up to time slot fc, have already been processed by the decision maker 
(see Figure 12). In particular, Aq = Ai = • • • = Ai_i = 0. 

• L^*^ e {0, 1, 2, • • • } denotes the queue length of the ith sensor node just after 
the beginning of time slot k (i.e., at time instant k+). The vector of queue 
lengths is Lfe = L^^\ • ■ • , Let Nk G {0, 1, 2, • • • , N} be the number 

of non-empty queues at the sensor nodes, just after the beginning of time slot k. 

N 



i.e., the number of sensor nodes that contend for slot k is N^- Hence, using the 
network model we have provided in Section 4, the evolution of L^^^ (see Figure 13) 
is given by the following: 



L« = 



^k+1 - < 



+ 1{A.+,=1A} w.p. (1 - a) if Nk > 0, 

I max{L« -1,0} + l{A.+.=i/r} w.p. ^ if TV^ > 0. 



Note that when all the samples generated up to time slot k have already been 
processed by the decision maker and k is not a sampling instant, i.e., = and 
A/c ^ 1/r, then L/c = (as there are no outstanding samples in the system). For 
e.g., Li = L2 = ■ • ■ = ^\/r-i = 0. Also, note that just after sampling instant th, 
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A new sample is received by the fusion center here if = j 



fc+1 





slots 



if > 



Fig. 14. The evolution of Wj^ from time slot k to time slot A; + 1. If a sample from node i is 
transmitted (successfully) during time slot k (i.e., M^. = i), then it is received by the fusion center 
at the end of time slot k (i.e. at {k + 1)—). If this sample is from batch B^, it is passed on to the 
decision maker. Otherwise, it is queued in the sequencing buffer, in which case ~ ^' 

On the other hand, if a sample from some other node j is transmitted (successfully) during time 
slot k (i.e., Mj, = j ^ i), and if this sample is the last component to be received from batch Sj. 
by the fusion center, then the HOL packet of the ith sequencing queue, if any, is also delivered to 
the decision maker. Thus, in this case, Wl^l^ = max{VF^'^ - 1,0}. Note that wl^l^ refers to the 
queue length corresponding to node i at the sequencer, at the beginning of time slot k + 1. 



• Mfe G {0, 1, 2, • • • , iV} denotes the index of the node that successfully transmits 
in slot k. Mk = means that there is no successful transmission in slot k. Thus, 
from the network model we have provided in Section 4, we note that 

( w.p. 1 if A^fe = 

Mfc = J w.p. (1 - a) if TVfe > 

[jw.p. ^ if4^) >0, j = l,2,... ,iV 

• e {0, 1, 2, • ■ • } denotes the queue length of the ith sequencing buffer at time 

k. The vector of queue lengths is given by = ■ ■ ■ , VF^^'']- Note 

that Wfe = if Afe = 0, i.e., the sequencing buffer is empty if there are no 
outstanding samples in the system. In particular, Wq = Wi = • • • = Wi = 0. 

The evolution of Wl. is explained in Figure 14. If a sample from node i of a batch 

later than is successfully transmitted during slot fc, then Wj._^-y = W^. +1. 
If a sample from node j of batch Bk is successfully transmitted and if it is the 

last sample to be received from batch Bk, then the queue lengths of sequencing 
buffer are decremented by 1, i.e., w'^^^ = maxjPF^*'' — 1,0}. 

• R'f' e {0,1} denotes whether the sample has been received and processed 
by the decision maker at time k. i?^*^ = means that the sample has not 
yet been received by the decision maker and i?^ — 1 means that the sample X^^'^ 
has been received and processed by the decision maker. The vector of R^^s is 
given by = i?f \ ■ • ■ ,r\P]. Note that, if = 0, w]:^^ = 0, i.e., the 
ith sequencing buffer is empty if the sample expected by the decision maker has 
not yet been transmitted. Also note that when A/- = 0, R/j = 0, as the samples 
from the current batch B^ have yet to be generated or have just been generated. 

We now relate the queue lengths L^^ W^^ . Note that at the beginning of 

time slot k, ^y^J batches have been generated so far, of which B^ — l batches are 
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completely received by the decision maker. In batch B/-, ith sample is received by 



the decision maker if R^^^ = 1. Hence, at time k, Bk — 1 + R^' samples generated 
by node i have been processed by the decision maker and the remaining samples 
are in the sensor and sequencing queues. Thus, we have 



(i) 



1/r 

k- 



B,/r + 1/r 



(6) 



1/r 



R 



1/r 



-R 



+i-i?: 



(i) 



if A; > Bk/r 

if fc = Bk/r 
if fc < Bk/r 



Recalling the definition of A^, we write the above Eqn. as, 



-(i) 



1/r 



+ 1 - 



if Afc > 

if Afe = 0,Afe = 1/r 
if Afc = 0, Afc ^ 1/r. 



(7) 



Note that in the above Eqn. A^ = 0, Afe = 1/r (or equivalently k = Bk/r), 
corresponds to the case when the samples of batch Bk have just been taken and 

all the samples from all previous batches have been processed. Thus, in this case 



L^'^ = 1 (as Wj.'' = 0). In the case of Afe = 0, Afe ^ 1/r (or equivalently k < Bk/r), 
all the samples from all previous batches have been processed and a new sample 



.(0 



from batch Bk is not taken yet. Thus, in this case L^^'' = (and W^'' = 0). Hence 
given Qfe = [Afe, Sfe, Afe, Wfe, Rfe], the queue lengths ife^s can be computed as 



r{i) 



Lf = </'L«(Qfe) : 
Also, Nk = ^wiQk) ■■ 



1/r 



N 



+ 1-R 



r,(i)(Qfe)>0}- 



if Afc > 

if Afe = 0, Afe = 1/r -(8) 
if Afe = 0, Afe ^ 1/r 

(9) 



i=l 



Thus, the state of the queueing system at time k, can be expressed as Qfe = 
[Afc,Sfe, Afc,Wfe,Rfc]. Note that the decision maker can observe the state Qfc per- 
fectly. The evolution of the queueing system is explained in the next subsection. 

5.2 Evolution of the Queueing System 

The evolution of the queueing system from time k to time A; + 1 depends only 
on Mfc, the success/no-success of contention on the random access channel. Note 
that the evolution of Afc is deterministic and that of Afe depends on Bk- Hence, 
to describe the evolution of Qfe, it is enough to explain the evolution of Bk,'Wk, 
and Rfe for various cases of Mfc. Let Yfe+i e {0} U (u^^^ .Y") denote the vector of 
samples received, if any, by the decision maker at the beginning of slot k + 1 (i.e., 
the decision maker can receive a vector of n samples where n ranges from to N). 
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At the beginning of time slot fc + 1, the following possibilities arise: 

No successful transmission: This corresponds to the case i) when all the 
queues are empty at the sensor nodes (Nk = 0), or ii) when some queues are 

non-empty at the sensor nodes {Nk > 0), either no queue attempts, or there is 
more than one attempt (resulting in a collision). In either case, = and the 
decision maker does not receive any sample, i.e., Yfc_|_i = 0. In this case, it is 

clear that Bk+i = Bk, Wfe+i = Wfe, and Rfe+i = Rfe. 

Successful transmission of node j's sample from a later batch: This 
corresponds to the case, when the decision maker has already received the jth 
component of the current batch Bk (i.e., R^^ = 1) and that it has not received 

some sample, say i ^ j, from the batch Bk (i.e., i?^'"* = 0, for some i). The 
received sample (is an out-of-sequence sample and) is queued in the sequencing 
buffer {W^2i = + Thus, in this case, Mfc = j and the decision maker does 
not receive any sample, i.e., Y^+i = 0. In this case, it is clear that Bfe+i = Bk, 
Wfe+i = Wfe + Bj, and Rfc+i = Rfc. 

Successful transmission of node j's current sample which is not the 
last component of the batch Bk'. This corresponds to the case when the 
decision maker has not received the jth component of the batch Bk before time 
slot k {Rk ^ = 0), and that it has received all the samples that are generated 
earlier than that of the successful sample. Also, the fusion center is yet to receive 
some other component of batch Bk (i.e., '}2iLi < N — 1). Thus, in this case, 
= j and the decision maker receives the sample Y^+i = Xq\ In this case, 
it is clear that Bk+i = Bk, Wfc+i = Wfc, and Rfc+i = Rfc + e^. 
Successful transmission of node j's current sample which is the last 
component of the batch Bk'. This corresponds to the case when the decision 
maker has not received the jth component of the batch Bk before time slot k 
(Rk^ = 0), and that it has received all the samples that are generated earlier 
than that of the successful sample. Also, this sample is the last component of 
batch Bk, that is received by the fusion center, (i.e., J^iLi^k^ = N — 1). In 
this case (along with the received sample), the queues of the sequencing buffer 
deliver the head of line (HOL) components (which correspond to the batch index 
Bk + 1), if any, to the decision maker and the queues are decremented by one 
{W^^^i = maxlM^^'' — 1,0}). Thus, Mk = j and the decision maker receives the 



vector of samples Yfe+i = 



where W^^ > for 



i G {i'l, i'l, ■ ■ ■ and W^^ = for i ^ {i'l, ' ' ' ^n-i}- this case, Bk+\ = 
-Bfc + 1, Wfc+i = Wfc-ej'^ -Cj^ ej^_^, and Rfc+i = e,'^ +ei^ H l-ei;_^. 

Thus, the state of the queueing system at time + 1 can be described by 

Qfc+i = '/>Q(Qfc,-M'fc) 

:= [0A(Qfc, Mfe), (^s(Qfe, Mk), (j)AiQk, Mk), (pwiQk, Mk), 0R(Qfe, Mk)] . 

In the next subsection, we provide a model of the dynamical system whose state 
has the state of nature 9^ as one of its constituents. The quickest detection of 
change of Ofc from to 1 (at a random time T) is the focus of this paper. 
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5.3 System State Evolution Model 

Let Gfc G {0,1}, fc > 0, be the state of nature at the beginning of time slot k. 
Recall that T is the change point, i.e., for fc < T, 0^ = and for fc ^ T, 0^ = 1, 
and that the distribution of T is given in Eqn. 1. The state 6^ is observed only 
through the sensor measurements, but these are delayed. We will formulate the 
optimal NADM change detection problem as a partially observable Markov decision 
process (POMDP) with the delayed observations. The approach and the terminol- 
ogy used here is in accordance with the stochastic control framework in [Bertsekas 
2000a]. At time k, a sample, if any, that the decision maker rcicciivcs is generated 
at time Bk/r < k (i.e., samples arrive at the decision maker with a network-delay 
of Afc = fc — ^ slots). To make an inference about 9fc from the sensor measure- 
ments, wc; nnist consider the vector of states of nature that corresponds to the 
time instants k — Ak, k — Ak + 1, • • • ,k. We define the vector of states at time 
k, Sk ■— [6fc-Afc, Bfc_Afc+i, ■ • • , ©fe]- Note that the length of the vector depends 
on the network-delay Afe. When Afc > 0, @k = [Ob^ , 9b^^^, • ■ • , 9/j], and when 

Afc = 0, 0fe is just [©fe]. 

Consider the discrete-time system, which at the beginning of time slot k is de- 
scribed by the state 

T/s = [Q/s,0fe], 

where we recall that 



Qfc = 

0fe 



Afc, Bk, Ak, Wfc, Rfc 



©fe+i = < 



if Afc+i = 

if Afc+i = Afc + 1 

if Afc+i =Ak + l 



[©ft-Afc, ©fe-Afe + l, • • • , ©fc]- 

Note that Fq = [[^,1,0, 0] ,©o]- At each time slot k, we have the following set 
of controls {0, 1} where represents "take another sample", and 1 represents "stop 
and declare change". Thus, at time slot k, when the control chosen is 1, the state 
Ffc+i is given by a terminal absorbing state t; when the control chosen is 0, the 
state evolution is given by Ffc+i = [Qfc+i, 0fc+i], where 

Qfe+i = (t>Q{Qk,Mk), 

©fe + l{T=fe+l}] , 

_©fc-Afc, ©fc-Afc + 1, • • • , ©fe, ©fc + l{T=fc+l}] 5 
©fe_At, + i: ©fe-Afc + i + l> • • • , ©fe, ©fe + l{r=fc+l} 

=: {Sk,Qk,Mk,l{T=k+i}) 

where it is easy to observe that ©fc + l{T=fe+i} = ©fe+i- When Afc+i = Afc -|- 
1, the batch Bk is still in service, and hence, in addition to the current state 
0fc+i = ©fc + l{T=fe+i}, we need to keep the states ©fc-A^, ©fc-A^+i, • • ■ ,0fe- 
Also, when Afc+i = Afc + 1 — i, then the batch index is incremented, and hence, 
the vector of states that determines the distribution of the observations sampled 

at or after Bk+i/r and before A; -|- 1 is given by 0fc_A^+i, ©fc-Afc+i+n ' ' ' > ©fc > 

©fc + l{T=fc+l}] • 

Define Ok ■— l{T=fe+i}, and define Nfc := [Mk,Ok] be the state-noise during 
time slot k. The distribution of state-noise Nfc given the state of the discrete- 
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time system is given by P (M^ = rn,Ok = o\Tk = [q, 0]) and is the product 
of the distribution functions, P (Mfe = m\Tk = [q, 6]) and P (Ofe = o | Ffc = [q, 0]). 
These distribution hmctions arc provided in Appendix - III. 

The problem is to detect the change in the state Qk as early as possible by 
sequentially observing the samples at the decision maker. 

5.4 The NADM Change Detection Problem 

We now formulate the NADM change-detection problem in which the observations 

from the sensor nodes are sent over a random access network to the fusion center 
and the fusion center processes the samples in the NADM mode. 

In Section 5.3, we defined the state Tk = [Qk,®k] on which we formulate the 
NADM change detection problem as a POMDP. Recall that at the beginning of slot 
k, the decision maker receives a vector of sensor measurements and observes the 
state Qfc of the queueing system. Thus, at time k, Zfc = [Q^, Y)^] is the observation 
of the decision maker about the state of the dynamical system Tk- 

Let Ak G {0,1} be the control (or action) chosen by the decision maker after 
having observed at k. Recall that represents "take another sample" and 1 
represents the action "stop and declare change". Let 1^ = [Z[o:fe] , j4[o:fe-i]] be the 
information vector^ that is available to the decision maker, at the beginning of time 
slot k. Let T be a stopping time with respect to the sequence of random variables 
Ii, I2, • ■ • . Note that Ak = ioi k < t and Ak = 1 ioi k ^ r. We are interested in 
obtaining a stopping time r (with respect to the sequence Ii, I2, • • • ) that minimizes 
the mean detection delay subject to a constraint on the probability of false alarm. 

min E[(r-T)+] (11) 
such that P (r < T) < a 

Note that in the case of NADM, at any time k, a decision about the change is made 
based on the information 1^ (which includes the batch index we are processing and 
the delays). Thus, in the case of NADM, false alarm is defined as the event {r < T} 
and, hence, r > T is not classified as a false alarm even if it is due to pre-change 
measurements only. However, in the case of NODM, this is classified as a false alarm 
as the decision about the change is based on the batches received until time k. 

Let c be the cost per unit delay in detection. Wc are interested in obtaining a 
stopping time r* that minimizes the expected cost (Bayesian risk) given by 

C(c,r*) = mmE[l{e^=o}+c-(T-T)+] 



= min E 

r 


T-1 

l{e,=o} + c-^l{e,=i} 




k 


=0 


= min E 

r 


T — 1 

gr{Tr,Ar) + ^gk{T'k,Ak) 




k-- 


=0 


= min E 

T 


00 

y^gfc(rfc, Afc) 





Lfc=o 



3 The notation Zj^^.^.^] := Zj,^, Zfc^+i, • • • , Z^^ 
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where, as defined earfier, Fh = [Qfe, &k]- Let 6 = [0s,9s-i, ■ ■ ■ ,0i,9o]. We define 
for k < T 



5fe([q, d],a) 



0, if 6»o = 0, a = 

1, if 6'o = 0,a= 1 
c, if 9o = l,a = 
0, if 6»o = l,a = 1 



(13) 



and for k > t, gk{-, •) := 0. Recall that Ak ~ for k < t and Ak = 1 for k ^ t. 
Note that A^, the control at time slot k, depends only on 1^. Thus, every stopping 
time r, corresponds to a policy fj, = (/Uq, ;Ui, • • ■ ) such that Ak = /Ufe(I/s), with 
j4i; = for fc < T and A/; = 1 for fc > r. Thus, Eqn. 12 can be written as 



C(c, r*) = minE 



.fe=o 



= miny^ E[£(;j(rfc, Afc)] (by monotone convergence theorem) 

fe=0 

oo 

= min E[gft(rfe,/Xfe(Ift))] (14) 



fc=o 



Since 0fc is observed only through 1^, we look at a sufficient statistic for 1^ in the 
next subsection. 

5.5 Sufficient Statistic 

In Section 5.2, we have illustrated the evolution of the queueing system Q;; and we 
have shown in different scenarios, the vector Y/j received by the decision maker. 
Recall from Section 5.2 that 



■ fe+i 



r0, 

0, 



if Mfc = 0, 



ifMfe=j>0,i?^-''' = l 
if Mfe = i > 0, R^'^ =Q,±R^^ <N-l 

i=l 

if Mfe = J > 0, = 0, e4'^ = iV - 1, 

JV 



JV 



Note that yfe+i,o corresponds to . The last part of the above equation corre- 

sponds to the last pending sample of the batch Bk arriving at the decision maker 
at time k + 1, with some samples from batch Bk + 1 (= Sfe+i) also being re- 
leased by the sequencer. In this case, the state of nature at the sampling instant 
of the batch i?fe+i = + 1 is O^-Afc+i/r- Note that Q^-Ak+i/r is a component 
of the vector @k as fc — + 1/r = {Bk + l)/r < k. Thus, the distribution of 
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ife+1,0, Yk+1,1,- ■ ■ , Yk+i,n is given by 

r/o(-), ifefc_A. =0 

/n+i,oU ifGfc.A, =land 

M+i,a-J ifefe_A,+i/. = i ' 



1,2, 



Thus, at time A; + 1, the current observation Y^+i depends only on the previous 
state Ffc, previous action Ak, and the previous noise of the system Nfe. Thus, a 
sufScient statistic is [P (Pfe = [q, ^] | Ifc)][q0]g5 (see page 244, [Bertsekas 2000a]) 
where <S is the set of all states of the dynamical system defined in Sec. 5.3. Let 
q = [A, b, S, w, r]. Note that 

p{Tk = [(i,e] life) 

= P(rfe = [q,0] |lfe_i,Qfe,Yfe) 

= l{Qfc=q} • P (®fe = ^ I Ife-l>Qfc = q, Yfe) 

= l{Qfc=q} 

■P{[&k-s,Ok-s+u--- ,Qk-i,Ok] - [es,es^i,--- ,0i,0o] |lfe_i,Qfc = q,Yfc) 

— l{Qfc=q} ' P i^k-S = I I/c-1, Qfe = q, Yfe) 

• P {^k-s+j = Os-j I Ok-s+j' = Os-j',j' = 0, 1, - • ■ ,j - l,Ife_i,Qfe = q, Yfe) 

(15) 

Observe that 

P (0fe-(5+j = ds-j I Q[k-5:k-5+j-2],Qk-S+j-l = 0,Ife-l,Qfe = q, Yfe) 

1-p, if0s-j=O 
p, if 9s-j = 1 

and 

P (0fe-<5+j = Os-j I 'd[k-5:k-5+j-2],^k-S+j-l = l,Ifc-l,Qfe = q, Yfe) 

0, if 9s-j = 

1, ifOs-j = 1- 

This is because given ©fe-5, the events {Qk-s+j = 9s-j}, {Ifc-i, Qfe — q, Yfe} are 
conditionally independent. Thus, Eqn. 15 can be written as 



= < 



P(rfe = [q,0] |lfe) 

' l{Qfe=q} ■ P i^k-S = 1 
l{Q.=q} • P i^k-S = 



ifc-i,Qfe = q, Yfe^ , if = 1 

Ifc-i, Qfe = q, Yfe) • (l-py-J-^p, if = [0, ... ,0, 1 



. l{Q,=q}-P(efe_5=0|lfe_i,Qfe = q,Yfe) .(1-p)^ if6> = 
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Define Gfe := Qk-Ak^ define 

*fc := P(efe = f |lfe_i,Qfe = [A,fe,(5,w,r],Yfe^ 

= P{ek-s = 1 |lfe-i,Qfe = [A,fe,(5,w,r],Yfc) 
Hfe := P(efe = f |lfe_i,Qfe = [A,fe,5,w,r],Yfc) 
= P{T ^k\lk-i,Qk = [\b,S,w,r],Yk) . 

Thus, Eqn. 16 can be written as 

P{Tk = [[\b,6,w,r],e] \ h) 

l{Qfc = [A,f),<5,w,r]} ■ *fe, if = 1 

l{Q. = [A,6,5,w,r]} ' (1 - • (1 - p)*"^" V, if = [0, ■ • ■ , 0, ^ 

. l{Q, = [A,6,5,w,r]} • (1 - • (1 - P)^ if = 

We now find a relation between 11^ and in the following Lemma. 



(17) 



18) 



Lemma 1 The relation between the conditional probabilities Ilk and 'i'k is given 
by 

Uk = *fc + (l-*fe)(l-(l-p)') 
Proof. Sec Appendix IV. □ 



(19) 



From Eqn. 18 and Lemma 1, it is clear that a sufficient statistic for Ik is Vk = 
[QfejIIfe]. Also, we show in Appendix - V that Uk can be computed recursively, 
i.e., when Ak = 0, i/k+i = [Qfe+ijlf^+i] = [Qk+i, 4>n{i^k,Zk+i)], and when Ak = 1, 
Vk+i = t, a terminal state. Thus, I'k can be thought of as entering into a terminating 
(absorbing) state t at r (i.e., Vk = [Qk,^k] for fc < r and = t for A; > r). 
Since Uk is sufficient, for every policy fik there corresponds a policy Jik such that 
/*fe(Ife) = V-ki^k) (see page 244, [Bertsekas 2000a]). 

5.6 Optimal Stopping Time t 

Let Q be the set of all possible states of the queueing system, Qfe. Thus the state 
space of the sufficient statistic is = (Q x [0, 1]) U {t}. Recall that the action 
space is ^ = {0, 1}. Define the one-stage cost fimction g : M x M_|_ as follows. 
Let V ^ N he a. state of the system and let a e y4 be a control. Then, 

f if = t 

g{v, a) = < c • TT if 1/ = [q, tt] , a = 
[ 1 — TT if !^ = [q, 7r],a = 1. 

Note from Eqn. 13 for k ^ t that 

E[sffc(Bfe,^fc)] = E[sffe(efc,/ife(Ifc))] 



= E 



gk{Qk, iJ-kiJ-k)) 
vk,V'k{vk))\ 
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and for k > t, 







Since, {uk} is a controlled Markov process, and the one-stage cost function 
the transition probability kernel for Ak = 1 and for Ak = (i.e., P (Zk+i \ i^k))-, 
do not depend on time k, and the optimization problem defined in Eqn. 14 is over 
infinite horizon, it is sufficient to look for an optimal policy in the space of stationary 
Markov policies (see page 83, [Bertsekas 2000b]). Thus, the optimization problem 
defined in Eqn. 14 can be written as 



C(c,T*) = min^E [5(z/fe,/xfe(zyfe))] 

fe=0 

oo 

= 51 ^ [9{^k,Ji*{t^k))] ■ 



k=0 



Thus, the optimal total cost is given by 



^*([qo,^o]) = 



k=0 



g{uk,n*{Pk)) 



^0 = [qo,7i"o] 



(20) 



(21) 



The solution to the above problem is obtained following the Bellman's equation, 



^*([q,^]) 



min I 1 — TT, CTT + E 



J* (Qfc+i, 0n(j^fc, Zfc+i)) 



(22) 



where the function (f)n{^'k, Zfe+i) is provided in Appendix - V. 



Remark 5.1 The optimal stationary Markov policy (i.e., the optimum stopping 
rule r) in general depends on Q. Hence, the decision delay and the queueing delay 

are coupled, unlike in the NODM case. 

We characterize the optimal policy in the following theorem. 

Theorem 3 The optimal stopping rule r* is a network state dependent threshold 
rule on the a posteriori probability Ilk, i-^-, there exist thresholds 7(q) such that 



T = inf{fc ^ : Hfe ^ 7(Qfc)} 
Proof. See Appendix- VI. □ 



(23) 



In general, the thresholds 7(Qfe)s (i.e., optimum policy) can be numerically ob- 
tained by solving Eqn. 22 using value iteration method (see pp. 88-90, [Bertsekas 
2000b]). However, computing the optimal policy for the NADM procedure is hard 
as the state space is huge even for moderate values of A''. Hence, we resort to a 
suboptimal policy based on the following threshold rule, which is motivated by the 
structure of the optimal policy. 

T = inf{/e > : Hfe > 7} (24) 
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where 7 is chosen such that P {t <T) = a is met. 

Thus, we have formulated a sequential change detection problem when the sensor 
observations arc sent to the decision maker over a random access network, and 
the fusion center processes the samples in the NADM mode. The information for 
decision making now needs to include the network state Qk (in addition to the 
samples received by the decision maker); we have shown that [Qfe,nfc] is sufficient 
for the information history 1^. Also, we have provided the structure for the optimal 
policy. Since, obtaining the optimal policy is computationally hard, we gave a 
simple threshold based policy, which is motivated by the structure of the optimal 
policy. 

6. NUMERICAL RESULTS 

Minimizing the mean detection delay not only requires an optimal decision rule at 
the fusion center but also involves choosing the optimal values of the sampling rate 
r, and the number of sensors N. To explore this, wc obtain the minimum decision 
delay for each value of the sampling rate r numerically, and the network delay via 
simulation. 

6.1 Optimal Sampling Rate 

Consider a sensor network with N nodes. For a given probability of false alarm, 
the decision delay (detection delay without the network delay component) decreases 
with increase in sampling rate. This is due to the increase in the number of samples 
that the fusion center receives within a given time. But, as the sampling rate 
increases, the network delay increases due to the increased packet communication 
load in the network. Therefore it is natural to expect the existence of a sampling 
rate r*, with r* < <t/N, (the sampling rate should be less than a/N, for the 
queues to be stable; see Theorem 2) that optimizes the tradeoff between these two 
components of detection delay. Such an r* , in the case of NODM can be obtained 
by minimizing the following expression over r (recall Theorem 1). 



{d{r) + l{r)) (1 - a) - p • l{r) + i min E 



K -K 
K-K 



1 + 

also depends 



Note that in the above expression, the delay term minn„ E 

on the sampling rate r via the probability of change Pr = 1 — {1 — p)*^^/'"-' . The delay 
due to coarse sampling l{r){l — a) — p-l{r) can be found analytically (see Appendix - 

I). We can approximate the delay minn„ E K — K by the asymptotic (as a — > 0) 

delay as jvj(/i /o')+"in(i-p )| where /(/i, /o) is the Kullback-Leibler (KL) divergence 
between the pdfs /i and /o (see [Tartakovsky and Veeravalli 2005]). But, obtaining 
the network delay (i.e., d{r){l — a)) analytically is hard, and hence an analytical 
characterisation of r* appears intractable. Hence, we have resorted to numerical 
evaluation. 

The distribution of sensor observations are taken to be A/'(0, 1) and A/'(l,l)^, 
before and after the change respectively for all the 10 nodes. We choose the prob- 



* As usual, J\f{a, v) denotes a normal distribution with mean a and variance v 
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60 ^ ' ' ' ' ^ 

0.01 0.015 0.02 0.025 0.03 0.035 



Sampling rate, r (samples per slot) 



Fig. 15. Mean detection delay for = 10 nodes is plotted against the sampling rate r for both 
NODM and NADM (defined in Eqn. 24). For NODM, an approximate analysis is also plotted. 
This was obtained with the prior probability p = 0, p = 0.0005, probability of false alarm target 
a = 0.01, (T = 0.3636 and with the sensor observations being A''{0, 1) and A/'(l,l), before and 
after the change respectively. 



ability of occurrence of change in a slot to be p = 0.0005, i.e., the mean time 

+ 



until change is 2000 slots. minn„ E 



K ~ K 



and d{r) are obtained from simu- 



lation for a = 0.01 and a = 0.3636 and the expression for mean detection delay 
(displayed above) is plotted against r in Figure 15. Note that both NODM and 
NADM are threshold based, and we obtain the corresponding thresholds for a tar- 
get PpA — 0.01 by simulation. These thresholds are then used to obtain the mean 
detection delay by simulation. In Figure 15, we also plot the approximate mean 
detection delay which is obtained through the expression for l(r) and the approx- 



imation, minn„ E 



K -K 



\Ma)\ 



We study this approximation 



~" W/(/l,/o) + |ln(l-p,)| 

as this provides an (approximate) explicit expression for the mean decision delay. 
The delay in the FJQ-GPS does not have a closed form expression. Hence, we still 
need simulation for the delay due to queueing network. It is to be noted that at 
k = 0, the size of all the queues is set to 0. The mean detection delay due to the 
procedure defined in Eqn. 24 is also plotted in Figure 15. 

As would have been expected, we see from Figure 15 that the NADM procedure 
has a better mean detection delay performance than the NODM procedure. Note 
that a/N = 0.03636 and hence for the queues to be stable (see Theorem 2), the 
sampling rate has to be less that a/N = 0.03636 (1/28 < 0.03636 < 1/27). As 
the sampling rate r increases to 1/28 (the maximum allowed sampling rate), the 
queueing delay increases rapidly. This is evident from Figure 15. Also, we see from 
Figure 15 that operating at a sampling rate around l/34(Ri 0.0294) samples/slot 
would be optimum. The optimal sampling rate is found to be approximately the 
same for NODM and NADM. At the optimal sampling rate the mean detection delay 
of NODM is 90 slots and that of NADM is 73 slots. 
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Fig. 16. Mean decision delay of NODM procedure for N X r = 1/3 is plotted against the the 
number of nodes A'^. The plot is obtained with p = 0, p = 0.0005, a = 0.01 and with the sensor 
observations being A/'(0, 1) and Af{l, 1), before and after the change respectively. The components 
of the mean decision delay, i.e., the coarse sampling delay (1 — a)l{r) — pl{r), and the decision 

maker delay, — minn^^ E \k — k\ are shown on the right. 



6.2 Optimal Number of Sensor Nodes (Fixed Observation Rate) 

Now let us consider fixing N x r. This is the number of observations the fusion 
center receives per slot in a network with N nodes sampling at a rate r (samples per 
slot). It is also a measure of the energy spent by the network per slot. Since it has 
been assumed that the observations are conditionally independent and identically 
distributed across the sensors and over time, it is natural to ask how beneficial it 
is to have more nodes sampling at a lower rate, when compared to fewer nodes 
sampling at a higher rate with the number of observations per slot being the same. 
With p = 0.0005, a = 0.01, and a = 0.3636, and fo - J\f{0, 1) and /i - A/'(l, 1), 
we present simulation results for two examples, the first one being Nr =1/3 (the 
case of heavily loaded network) and the second one being Nr = 1/100 (the case of 
lightly loaded network, Nr ^ a). 

Figure 16 shows the plot of mean decision delay, l{r){l—a~p)+^ minn^ E K — K 

versus the number of sensors when Nr — 1/3. As N increases, the sampling rate 
r = l/(3iV) decreases and hence the coarse sampling delay l{r){l — a) increases; this 
can be seem to be approximately linear by analysis of the expression for l{r) given in 
Appendix - I. Also, as N increases, the decision maker gets more samples at the de- 
cision instants and hence the delay due to the decision maker ^ minn^ E K ~~ K 

decreases (this is evident from the right side of Figure 16). Figure 16 shows that 
in the region where N is large (i.e., N ^ 20) or N is very small (i.e., iV < 5), as 
N increases, the mean decision delay increases. This is because in this region as N 
increases, the decrease in the delay due to decision maker is smaller compared to 
the increase in the delay due to coarse sampling. However, in the region where N 
is moderate (i.e., for 5 ^ TV < 20), as N increases, the decrease in the delay due to 
decision maker is large compared to the increase in the delay due to coarse sam- 
pling. Hence in this region, the mean decision delay decreases with N . Therefore, 
we infer that when N x r = ^, deploying 20 nodes sampling at 1/60 samples per 
slot is optimal, when there is no network delay. 
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Number of Sensors, N 

Fig. 17. Mean detection delay for N X r = 1/3 is plotted against the number of nodes N. This 
was obtained with p = 0, p = 0.0005, a = 0.01 cr = 0.3636 and with the sensor observations being 
Af{0, 1) and J\f{l, 1), before and after the change respectively. 

Figure 17 shows the mean detection delay (i.e., the network delay plus the decision 
delay shown in Figure 16) versus the number of nodes N for a fixed N x r = 1/3. 
As the the number of nodes N increases, the sampling rate r = l/(3iV) decreases. 
For large N (and equivalently small r), in the case of NODM with the Shiryaev 
procedure, the network delay, d{r) « ^ as it requires N (independent) successes, 
each with probability cr, in the random access network to transport a batch of N 
samples (also, since the sampling rate r is small, one would expect that a batch 
is delivered before a new batch is generated) and the decision maker requires just 
one batch of N samples to stop (after the change occurs). Hence, for large N, the 
detection delay is w lir){l — a) + d{r){l — a) « l{r){l ~ a) + ^{1 — a). It is to be 
noted that for large N, to achieve a false alarm probability of a, the decision maker 
requires Na < N samples (the mean of the log-likelihood ratio, LLR of received 
samples, after change, is the KL divergence between pdfs /i and /o, given by 
/(/i, /o) > 0. Hence, the posterior probability, which is a function of LLR, increases 
with the the number of received samples. Thus, to cross a threshold of 7(a), we 
need iV„ samples). Thus, for large N, in the NADM procedure, the detection delay 
is approximately ?(f)(l — a) + -^(1 — a), where Na/a is the mean network-delay 
to transport Na samples. Thus, for large iV, the difference in the mean detection 
delay between NODM and NADM procedures is approximately ^-^{N ~ No)- Note 
that Na depends only on a and hence the quantity ^-^{N — Na) increases with N. 
This behaviour is in agreement with Figure 17. Also, as iV x r = 1/3, we expect 
the network delay to be very large (as 1/3 is close to cr = 0.3636) and hence having 
a single node is optimal which is also evident from Figure 17. 

It is also possible to find an example where the optimal number of nodes is 
greater than 1. For example this occurs in the above setting for N x r = 0.01 (see 
Figure 18). Note that having = 10 sensors is optimal for the NADM procedure. 
The NODM procedure makes the decision only when it receives a batch of A^ samples 
corresponding to a sampling instant, whereas NADM procedure makes the decision 
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Fig. 18. Mean detection delay for N xr = 0.01 is plotted against the the number of nodes N. This 
was obtained with p = 0, p = 0.0005, a = 0.01 and with the sensor observations being A^(0, 1) 
and J\f{l, 1), before and after the change respectively. 



at every time slot irrespective of whether it receives a sample in that time slot or 
not. Thus, the Bayesian update that NADM does at every time slot makes it stop 
earlier than NODM. 



7. CONCLUSIONS 

In this work we have considered the problem of minimizing the mean detection 
delay in an event detection on a small extent ad hoc wireless sensor network. We 
provide two ways of processing samples in the fusion center: i) Network Oblivious 
(NODM) processing, and ii) Network Aware (NADM) processing. We show that 
in the NODM processing, under periodic sampling, the detection delay decouples 
into decision and network delays. An important implication of this is that an 
optimal sequential change detection algorithm can be used in the decision device 
independently of the random access network. We also formulate and solve the 
change detection problem in the NADM setting in which case the optimal decision 
maker needs to use the network state in its optimal stopping rule. Also, we study 
the network delay involved in this problem and show that it is important to operate 
at a particular sampling rate to achieve the minimum detection delay. 



Appendix - I 

Proof: (Theorem 1) 



min E 

n„ 



min E 

n„ 



min s E 

He 



{U -T + T 
{U-T)I^f 



K K 

r r -f^^^J 
K 
r 



T I 



{T^T} 



(25) 
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Note that in Eqn. 25, the first term is the queueing delay, the second term is the 
coarse sampling delay and the third term is the decision delay (all delays being in 
slots). Consider the first term. 



Yl P(T^j,K = b,Di, = x)x-I^!,^^y 

j>0,b>0,x>0 

= Yl P{T = j,K = b)p{D, = x)x.I^,^,y 

j>0,6>0,x>0 

where we have used the facts that (i) the decision process is based on only what 
the packets carry and not on their arrival time etc, and (ii) the condition that 
sampling is done periodically at a known rate r. Assuming the queueing system to 
be stationary, the above can be written as 



= {Y.P{D = x)x]Y,p(T = j,K = b)I^,^,y 

\x>0 J j,b 

= E[D] p ^ . 



Note that E[D] is a function of the sampling rate r, and does not depend on the 
detection policy. 

Consider the second term of Eqn. 25, 



--T\I~ 







[(f 





















'{K>K} 



'{K>K,So = l} 

For 5*0 = 1, we have T = and K = 0. Hence, 



^-T]I~ 

r I {K>K,So=0} 



K 

y - ^ j ^{T>T} 



= + Eo 



K 



where Eq [•] denote the expectation and Pq (•) the probability law, when the initial 
state is 5*0 = 0. Now, 



Eo 



r 



{K>K} 



h/r 



EE E P.{T = t,K = b,K=t).il^t 



b=l b=b t=(6-l)/r+l 

oo oo 

EEPo {K = b,K = b) 

1>=1 b=b 
b/r 

Pq{t = t\ K = b,K = b 

_t=(6-l)/r+l 
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We note that K is independent of T given K. Hence, 



-0 



We have 



oo oo 

EEPo {K = b,K = b) 

b=i b=b 

■[ E yPo(T=l-y\K = b 



PoiT = t\K = b) = \ it-;lt-SX ^ forts,. b=\t.r] 



0, 



Hence, for < y < 1/r — 1, 



Po\T=--y\K = b 
But, {1-Pr) = (1 -p)^/''. Hence, 



Po{T=---y\K = b 



otherwise. 



(1- 




-y-ip 


(1 


-Pr)"- 


~^Pr 


(1- 


p)b/r- 




(1 


-Prf- 




(1- 


pY/r- 


-y-ip 



l-(l-p)iA 



It can be shown that 



l-(l-p)V'- 



r 

■■ l{r) 



P rpr 



Therefore, Eqn. 26 can be written as 



l{r) ■Po(^K>K^ = l{r) • (p (if > if) - 

= i{r) • (i - p < a:) - p) 

Finally, we have 

rninE[(C/-r)/^^>^j 

= min |(i(r) (l - P (t < t)) + l{r)Po (t > t) + ^e[(K - | 
= niin|(rf(r)+/(r)) (l - P (t < t)) - p ■ l{r) + ^e[{K - K)+'^^ 
Note that, in the above equation, the first term {d{r) + l{r)) IS 
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minimum 



when p(t <T^ =a. It follows that 



min E 

He 



iU-T)L^ 



{T>T} 



> (d(r) + l{r)) (1 - a) - p • Z(r) + - min E \^[K - K)+ 



Also, since the optimal policy for the problem minn„ E 
a, we also have 

(d(r) + l{r)) (1 - a) - p • l{r) + J min E {K - K)+ 

It follows that 



{K - K)+ achieves (l - P (t < t)) 

{U — T)I^f ^j,y 



^ min E 

Ha 



min E 

n„ 



{U - T)I^f^^y\ = (rf(r) + Z(r)) (1 - a) - p ■ /(r) + - min E [(if - /^)+ 



We need 1 — a > p or a < 1 — p. If a > 1 — p, the optimal stopping is at i = 0. 
This will yield the desired probability of false alarm and E {U — T)I^f^j,y =0. □ 



Appendix - II 

Proof: (Theorem 2) The necessity of Nr < cr is clear. The sufficiency proof goes as 
follows. Consider the FJQ-GPS system with every queue always containing a single 
dummy packet that is served at low priority. Let us call this the saturated FJQ-GPS 
system. When a queue becomes empty, the low priority dummy packet contends 
for service. If it receives service, then it immediately reappears and continues 
to contend for service. If, while a dummy packet is in service, a regular packet 
arrives, then the service of the dummy packet is preempted and the regular packet 
starts contending. It follows that the service rate applied to every queue (i.e., those 
with regular packets or those with dummy packets) is always a/N. Now, consider a 
virtual service process of rate a. In each slot, a service occurs with probability a and 
the service is applied to any one of the queues with equal probability. Equivalently 
each queue is served by an independent Bernoulli process of rate a/N. Considering 
only the services to the regular packets at each queue, we have a GI/M/l queue 
(here GI refers to a General distrihuiion with Independent arrivals, M refers to 
a Ma,rkovia,n service process and 1 refers to one server). Hence, the system has 
proper stationary delay, iff r < a/N. Also, it can be seen that the delays in the 
above described system (with dummy packets when a queue is empty) upper bound 
those in the original FJQ-GPS system. Hence, the result follows. □ 



Appendix - III 

Distribution of state noise N 
Let q = [A, b, 5, w, r]. Note that P (Mfc = m | Qfe = q, 0fe = 0) = P (Mfc = m | Qfe = q) 
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and is given by 

P (Mfe = I Qfe = q) 
P (Mfc = TO I Qfe = q) 

where 07v(q) and (/)^(™) (q) arc obtained from Eqns. 9 and 8. 

The distribution function, P (O/c = o | = q, 0^ = 6) = P {Ok = o | = q, = 0) 
is given by 



Jl if0^(q) = O 

\l-cj if <A^(q) > 

ro if<^^(q) = o 

if</.iw(q)>0, TO=1,2,3,.-. ,iV. 



P(Ofc = o|Qfc =q,efe = 0) = 
P(Ofc = o|Qfc = q,efc = l) =■ 



l-p 


if = 


p 


if o== 1, 





otherwise. 


1 


if = 





otherwise. 



Appendix - IV 

Proof of Lemma 1 

Let q = [A, b, 6, w, r]. From Eqn. 17, 

Hfc := P(T^fc|lfc_i,Qfc=q,Yfc) 

= P(r^A;-5|lfe_i,Qfc = q,Yfe) + P (fc - 5 < T ^ fc | , = q, Yfe) 

= P (T ^ fc - 5 I Qfe = q, Yfe) + P (T > - 5 I Ifc_i, Qfe = q, Yfe) • P (T < fc | T > - <5,Ifc-i, Qfe = q, Yfc) 

= *fc + (1 - *fe) ■ P (T < fc I T > fc - 5, Ife_i, Qfe = q, Yfe) , 

P (fc - <5 < T ^ fc) P , Qfe = q, Yfc I fe - 5 < T ^ fe) 



*fe + (l-*fe)- 



P (T > fc - 5) P (Ife_i , Qfc = q, Yfc I T > fe - 5) 



^, + (l-^,).P(^-^<^^^) (27) 

*fc + (l-*fc)(l-(l-p)'') (28) 
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Eqn. 27 is justified as follows. Note that 
P{lk-i,Qk^ci,Yk\k~d <T ^k) 

= P (Q[0:fe-l],Qfc - q,X[i.B,^l],{^g : R'k^ = l},U[0:k-l] \ k - S < T ^ 

= P(,Q[0:k-i],Qk = q\k-S <T ^k) 

■P (X[i.B,-i], {Xg :R^^^ = l}\k-6<T^k, Q[o:fc_i], Qfe = q) 

•P («[o:fc-i] \k-5<T^k, Q[o:fe-i], Qfc = q,X[i.B,-i], {^i'i : = 1}) 

= P (Q[0:fc-i], Qfc = q) • P (X[i,B,_i], {Xg : iif = 1} I fc - <5 < T, Qio:k-i],Qk = q) 

•P {u[o:k-i] I Q[0:fe-i],Qfe = q,X[i,B.-il,{^i'^ : = 1}) 
= P (Q[0:fc-i], Qfe = q\T>k-6)-P (X[i^B,_i], {Xg : = 1} I T > fc - (5, Q[o.fc-i], Qfc = q) 

•P (^[0:^-1] \\T>k-S, Q[0:fe-1], Qfe = q,X[i.B,_i], {^S " ^k^ = ^l) 

= P(lfe_i,Qfe = q,Yfc|T> 

We use the following facts in the above justification: i) the evolution of the queucing 
system is independent of the change point T, ii) whenever T > k — 6, the 
distribution of any sample xj^\ h ^ Bk is fo, and iii) the control Uk = A(I/s)- D 

Appendix - V 

Recursive computation ofllk 

At time k, based on the index of the node that successfully transmits a packet 
Mfe, the set of all sample paths Q can be partitioned based on the following events, 

£i,k := {w : Mfe(w) = or Mk{uj) = j > 0, R^i\oj) = l} 

|w : Mk{uj)=j > = 0,Y,R^k\^) <N-lj 

i.e., = fi^/c U f2,fe U f3,fc. We note that the above events can also be described by 
using Qfe and Qfe+i in the following manner 

Si,k = {w : Wfe+i(a;) = Wfe(a;). Rfc+i(u;) = Rfe(w)} 

[j{u : Wk+i{uj) = Wk{uj) + e,,Rk+i{uj) = Rfc(w)} 

£2,k = {w : Wfe+i(w) = Wfe(w),Rfe+i(w) = Rfc(w) +6^} 



--2,fe 



'3,fe 



AT 



£s,k = u; :Y.R):\u;) = N- l,\/i,W^>_,{u;) = {wt'\uj) - = l{iy«>o} 

I i=l 

Here, the events Si^k and £2,k represent the case Bk+i = B^, and the event £3^k 
represents the case B^+i = B). + 1 (i.e., only if the event £3^k occurs then the 
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batch index is incremented). We are interested in obtaining Ilfe+i from [Q/cH^] 
and Zfc_|_i. We show that at time k + 1, the statistic ^k+i (after having observed 
Zfe+i) can be computed in a recursive manner using and Qfe. Using Lemma 1 
(using Eqn. 19) we compute n^+i from \E'ft+i. 

= P (Ok+i = 1 I Ik+i) 

3 

= ^ P (©fc+l = l)^c,fc I Ifc+l) 

c=l 
3 

= ^ P (S/s+i = 1 I fc,fe,Ifc+i) Ife.fc (■-■ ^c,fc is Ife+1 measurable) 



c=l 



Case Mfc = or Mfe = j > 0, r[^^ = 1: 

= P (0fe+i = 1 I fi,fe,Ife+i) 

= P (0/c+i = 1 1 £i,k, ife, Qfe+1 = q') 

_ P(Bfe+i = 1 I fi,fe,Ife) ■ /Qfc+i|efc+i,£:i,fc,ifc(q'|l,fi,fe,Ife) 



(by Bayes rule) 



= P (Gfe+i = 1 I £^i,fe,Ife) (Qfe+i is independent of Gfe+i) 

= P (Gfc = 0, Qk+i = 1 I Ifc) + P (Ofe = 1, Gfc+i = 1 I Ifc) 
= (1 - Uk)p + Ilk 

Case Mk=j> 0, r[^'^ = 0, Ylf^i ^k^ <N-l:hi this case, the sample x'^l 
is successfully transmitted and is passed on to the decision maker. The decision 
maker receives just this sample, and computes Hk+i- We compute Vl/fc+i from 
and then we use Lemma 1 (using Eqn. 19) to compute llfc+i from ^'fe+i. 

*fc+i 

= p(Gfc+i = l |£2,fe,Ifc+i) 

= P (Ofc+i = 1 I f2,fe,Ifc,[Qfe+i,Yfe+i] = [q',2/]) 
= P (Gfc = 0,efe+i = 1 I f2,fc,Ife,[Qfc+i,Yfc+i] = [q',2/]) 
+P (Ofe = l,Gfe+i = 1 I f2,fe,Ife, [Qfe+i, Yfc+i] = [q',2/]) 

Since, we consider the case when the fusion center received a sample at time 
k + 1 and Bk+i = Bk, Afe+i = Afe + 1 and hence, the state G^+i = Qk+i-Ak+i = 
Gfe-Afc = Gfc. Thus, in this case, ^'fe+i can be written as 
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(a) 



(b) 



(c) 



P (Ofc = l,Gfc+i = 1 I fz.fc.Ifc, [Qfc+1, Yfc+i] = [q',j/]) 

P (Ofc = l,efc+i = 1 I f2,fc,Ife) ■ P (Qfe+i = q' I ©fc = l,0fc+i = l,£2,fc,Ifc) 

P(Qfc+i = q'|f2,fc,ifc) •/Yfc+i|f2,fc,ijc,Qfe+i(2'l^2,fc,ifc,q') 

P(efe = 1,6^+1 = 1 I £2,k,lk) ■ P{Qfc+i =q'|f2,/c,Ifc) ■/y,,^^|©^{2/ I 1) 

P(Qfc+i = q'|f2,fc,ife) ■ /Yfe+i|£2,fc,ifc,Qfc+i(yl^2,fc,ifc,q') 
p (Ok = 1,6^+1 = 1 1 £2,k,ik) ■ h{y) 



p(efc = I £:2.fe,lfe,Qfe+i) •/Y^^^|e^(y|0) + P(efe = i i £2,fe,lfe,Qfe+i) • /Y^^^|e^(2/|i) 

(J 

(i-*fc)/o(y) + *fc/i(y) 
We explain the steps (a), (6), (c), (d) below, 
(a) By Bayes rule, for events A, i?, C, D, i?, we have 

P (AS I CD) P (i; I ABCD) P (i^ I ABODE) 



P (AS I CilEF) 



P {E I C£>) P (F I CDE) 



(b) Qfe+i is independent of Ofe, Ofc+i. Also, given 9^, Yfc_|_i is independent of 

0fe+l, ^2,fe) Ifc, Qfe+1 

(c) For any events A, B, and a continuous random variable Y, the conditional 
density function fY\Aiy\A) = P {B \ A) fY\AB{y\AB)+P (S'= | A) fY\ABr^{y\AB'') 
Also, given 6fe, Y^+i is independent of £2,fe,Ife, Qfe+i 

(d) £2,k is [Ife, Qfe+i] measurable, and hence, given [Ife, Qfe+i], ©fe is independent 

of £2M- 

• Case Mk=j> 0, r[^'^ = 0, J2^^i = iV - 1: In this case, at time /c + 1, the 
decision maker receives the last sample of batch Bk, X^^ (that is successfully 
transmitted during slot k) and the samples of batch Bk + 1, if any, that are 
queued in the sequencer buffer. We compute ^fe+i from \E'fe and then we use 
Lemma 1 (using Eqn. 19) to compute Ilfe+i from ^fe+i. In this case, the decision 
maker receives n := J2iLi '^{w'-*^>o} samples of batch Bk + 1. Also, note that n 
is Ife measurable. 



*fe+l = P [Qk+l = 1 I f3,fe,Ife+l^ 

= P (Ofc+i = 1 I £s,k,lk, [Qfc+i, Yfe+i] = [q',y]) 

= P (Ofe = 0,6^+1 = 1 I £3,k,lk, [Qfe+i, Yfc+i] = [q',y]) 

+P (Ofe = 1,6^+1 = 1 I £3,kJk, [Qfe+i, Yfe+i] = [q',y]) 

Since, we consider the case Bk+i = Bk + 1, Afe+i =Afe + l — 1/r and hence, the 
state 6fe+i = 6fe+i_Afc+i = 6fe_Afe+i/r- 
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Let y = [j/o, 2/1, • • • , Vn]- Consider 

p(efe = e,efc+i = 1 1 ei,k,ik,lQk+i,Yk+i] = [q',y]) 
(a) p(0fc = e,Gfc+i = 1 1 £-3,fe,ife) ■ p (Qfc+1 = q' I efe = e,efe+i = i,S3,k,ik) 
P(Qfe+i = q'l^^3,fc, ifc) • /yj^+i |£3,fc,ifc,Qfc+i (yi^^s.fc, ifc, q') 



p (Ofc = e,0k+i = 1 1 f3,fc,ifc) ■ P{Qfc+i = q'|f3,fc,ifc) • 4(2/o)nr=i/i(2/i) 



P(Qfc+i = q'|f3,fc,ifc) ■ /Yfc+i|£:3_fc,ifc,Qfe+i(y|^^3,fc,ifc,q') 

p (Ofc = e I f3,fc,ifc) • P (Ofc+i = 1 1 = 0,£:3,fc,ifc) • feivo) niLi 

We explain the steps (a), (6), (c) below, 
(a) By Bayes rule, for events A, B, C, D, E, F, we have 

P {AB I CD) P {E I ABCD) P {F \ ABODE) 



P [AB I CDEF) 



P (i; I CD) P (i^ I CDE) 



(b) Qfe+i is independent of O^, Qk+i- Also, given 0^, Yfe+i,o is independent of 

0fc+i, ^s.fc, Ifc, Qfc+i, and given Qk+i, ^k+i,i is independent of Ofc, £z^k, Ife, Qfe+i- 
It is to be noted that given the state of nature, the sensor measurements 
^fc+i,0)^fc+i,i, ■ • • , Yk+i.n are conditionally independent. 

(c) For any events A, B, and a continuous random variable Y, the conditional 
density function = P {B \ A) fYiABiy\AB)+P {B- \ A) fY\ABMAB-)- 
Also, given Ofc, Yk+i is independent of £3,fc,Ifc, Q^+i 

It is to be noted that the event £^^k is [Ifc,Qfc+i] measurable, and hence, given 
[Ifc)Qfc+i]) 0fc is independent of i?3,fe. Thus, in this case, 

(1 - *fe)pr /o(2/o)nLi hivi) + *fe/i(2/o)nr=i/i(2/i) 



^k+1 



(1 - f fc)(i - Pr)Myo) nr=i Mvi) + (1 - *fc)Pr/o(2/o) nr=i /i(2/i) + */=/i(!/o) nr=i 

Thus, using Lemma 1 (using Eqn. 19), we have 

Hfe+i = *fc+i + (l-*fe+i)(l-(l-p)^'=+0 

=: ^vtl^-fe, Zfc+i) + (1 - 0*(*fe, Zfe+i)) (1 - (1 - 

/ nfc-(i-(i-p)^^) 
= (rr^)A; 'Zfc+i 

'Hfe - (l-(l -p)A.) 

(i-p) 

^'n ([Qfe, life], Z/c+i) . 



Appendix - VI 

Structure of t* We use the following Lemma to show that J* (q, tt) is concave in it. 
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Lemma 2 // / : [0, 1] — >■ R is concave, then the function /i : [0, 1] ^ M defined by 

y ■ (f)2{x.) + (1 - y)Pr ■ (f)i{x.) 



h{y) = ^<t>(^ 



f 



y ■ ^2(x) + (1 - y)Pr ■ (f'li^) + (1 - y){l-Pr) ■ 0o(x) 



is concave for each X, where (j){x) = y-(l)2{x) + {l — y)pr-4>i{x) + {l — y){l—pr)-4>o{^), 
< Pr < 1, and ^o(x), ^i{x), and <^2(x) are pdfs on X. 

Proof. See Appendix - I of [Premkumar and Kumar 2008]. □ 

Note that in the finite i?-horizon (truncated version of Eqn. 21), we note from value 
iteration that the cost-to-go function, for a given q, J^([q, tt]) = 1 — tt is concave 
in TT. Hence, by Lemma 2, we see that for any given q, the cost-to-go functions 
^])' •^iF-2([Q' ■ ' ■ ' "^o^dlj^]) concave in tt. Hence for < A < 1, 

J*([q,7r])= hm Jo^([q,7r]) 

J*([q, Atti + (1 - A)7r2]) = Um ([d, Atti + (1 - X)w2]) 

> Um AJo^([q,7ri]) + Um (1 - A) Jo^([q, tts]) 

= AJ*([q,7ri]) + (l-A)J*([q,7r2]) 

It follows that for any given q, J*([q, tt]) is concave in tt. □ 

Define the map ^ : Qx [0, 1] K+ as ^([q, tt]) := I-tt and the map k : Qx [0, 1] -5- 



l+, as K([q, tt]) := c-tt+Aj. ([q, tt]) = c-tt+E 



J* ([Qfe+i,0n(t'fe,Zfe+i)]) 



Note that ^([q, 1]) = 0, K([q, 1]) = c, ^([q, 0]) = 1 and 



z^fe = [q, tt] 



K([q,0]) = E 

(2) ,- 



i^k = [q, 0] 



N 



= EE 

m=0 

(4) 

N 



J* {[Qk+i,4>n{i'k, Zfc+i)]) 

J* {[<pQ{Qk, Mfe), M'^k, Zfe+i)]) 

J* ([0Q(q>w),(/>n(i^fc,Zfe+i)]) 



0Q(q,w),E 



t'fe = [q, 0] 

Mk = m,Vk = [q, 0] 
Mk = m,Vk = [q, 0] 



P Mfc = TO 



P Mfe = TO 



[q,o]j 

= [q,o]' 



= ^J*([0Q(q,m),p)P Mfe 



m=0 

(6) ^ 

< ^(i-p)-p(a4 

m=0 
= l-p < 1 



TO 



i^fc = [q, 0] 



I'fe = [q, 0] 



where in the above derivation, we use the evolution of Qfc in step 2, the Jensen's 
inequality (as for any given q, J* (q, tt) is concave in tt) in step 4, and the inequality 
J*(q, tt) < 1 — tt in step 6. 
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Note that K([q, 1]) - ^([q, 1]) > and K([q, 0]) - ^([q, 0]) < 0. Also, for a fixed q, 
the function K;([q, tt]) — C([q, tt]) is concave in tt. Hence, by the intermediate value 
theorem, for a fixed q, there exists 7(q) G [0, 1] such that K([q, 7]) = ^([cIjT])- This 
7 is unique as ^([q, tt]) = C([q, tt]) for at most two values of tt. If in the interval 
[0,1], there are two distinct values of n for which /t([q, tt]) = ^([q, tt]), then the 
signs of K([q, 0]) — ^([q, 0]) and K([q, 1]) — ^([q, 1]) should be the same. Hence, 

T* =inf{fc:nfc >7(Qfe)} 

where the threshold 7(q) is given by c • 7(q) + Aj» ([q, 7(q)]) = 1 — 7(q)- □ 

REFERENCES 

Aldosari, S. a. and Moura, J. M. F. 2004. Detection in decentralized sensor networks. In 
Proceedings of ICASSP. 11:277-280. 

Baccelli, F. and Makowski, a. 1990. Synchronization in qucucing systems. In Stochastic 
Analysis of Computer and Communication Systems, H. Taltagi, Ed. North-Holland, 57—131. 

Bertsekas, D. p. 2000a. Dynamic Programming and Optimal Control, Second ed. Vol. I. Athena 
Scientific. 

Bertsekas, D. P. 2000b. Dynamic Programming and Optimal Control, Second ed. Vol. II. 
Athena Scientific. 

Honeywell Inc . http://hpsweb.honeywell.com/Cultures/en-US/Products/Wireless/ 

SecondGenerationWlreless/def ault .htm. 
ISA . http : //www. isa. org/Content/NavlgationMenu/Technlcal_Inf ormatlon/ASCI/ISA100_ 

Wireless_Compliance_Institute/ISA100_Wireless_Compliance_Institute .htm. 
Kumar, A., Manjunath, D., and Kuri, J. 2004. Communication Networking: An Analytical 

Approach. Morgan-Kaufmann (an imprint of Elsevier) , San Francisco. 
Kumar, A., Manjunath, D., and Kuri, J. 2008. Wireless Networking. Morgan-Kaufmann (an 

imprint of Elsevier), San Francisco. 
Niu, R. and Varshney, P. K. 2005. Distributed detection and fusion in a large wireless sensor 

network of random size. EURASIP Journal on Wireless Communications and Networking 4, 7, 

462-472. 

Premkumar, K. and Kumar, A. 2008. Optimal sleep-waJje scheduling for quickest intrusion 
detection using wireless sensor networks. In Proc. IEEE Infocom. AZ, USA. 

Premkum.'VR, K., Kum.'VR, A., and Kuri, .1. 2009. Distributed detection and localization of 
events in large ad hoc wireless sensor networks. In Proc. 47*'' Annual Allerton Conference on 
Communication, Control, and Computing. IL, USA. 

Rajasegarar, S., Leckie, C, and Palaniswami, M. 2008. Anomaly detection in wireless sensor 
networks. IEEE Wireless Communications 15, 4 (August), 34-40. 

Shiryaev, a. N. 1978. Optimal Stopping Rules. Springer, New York. 

Singh, C. K., Kumar, A., and Ameer, P. M. 2008. Performance evaluation of an IEEE 802.15.4 
sensor network with star topology. Wireless Networks 14, 4 (August), 543-568. 

Tartakovsky. A. G. .vnd Veeravalli, V. V. 2005. General asymptotic bayesian theory of 
quickest cliaugc detection. SIAM Tfieory of Probability and its Applications 4-9, 3, 458-497. 

Tenny, R. R. and Sandell, N. R. 1981. Detection with distributed sensors. IEEE Transactions 
on Aerospace and Electronic Systems 17, 501-510. 

Veeravalli, V. V. 2001. Decentralized quickest change detection. IEEE Transactions on Infor- 
mation theory 47, 4 (May), 1657-1665. 

Received March 2009; revised December 2009 and June 2010; accepted Month Year 



ACM Transactions on Sensor Networks, Vol. V, No. N, Month 20YY. 



